Learning to Resolve Natural Language

نویسنده

  • Dan Roth
چکیده

We analyze a few of the commonly used statistics based and machine learning algorithms for natural language disambiguation tasks and observe that they can be re-cast as learning linear separators in the feature space. Each of the methods makes a priori assumptions, which it employs, given the data, when searching for its hypothesis. Nevertheless, as we show, it searches a space that is as rich as the space of all linear separators. We use this to build an argument for a data driven approach which merely searches for a good linear sepa-rator in the feature space, without further assumptions on the domain or a speciic problem. We present such an approach-a sparse network of linear separators, utilizing the Winnow learning algorithm and show how to use it in a variety of ambiguity resolution problems. The learning approach presented is attribute-eecient and, therefore, appropriate for domains having very large number of attributes. In particular, we present an extensive experimental comparison of our approach with other methods on several well studied lexical disambiguation tasks such as context-sensitive spelling correction, prepositional phrase attachment and part of speech tagging. In all cases we show that our approach either outperforms other methods tried for these tasks or performs comparably to the best.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ensemble Learning for Low Resources Prepositional Phrase Attachment

Prepositional phrase attachment is a major disambiguation problem when it’s about parsing natural language, for many languages. In this paper a low resources policy is proposed using supervised machine learning algorithms in order to resolve the disambiguation problem of prepositional phrase attachment in Modern Greek. It is a first attempt to resolve prepositional phrase attachment in Modern G...

متن کامل

Interactively Picking Real-World Objects with Unconstrained Spoken Language Instructions

Comprehension of spoken natural language is an essential skill for robots to communicate with humans effectively. However, handling unconstrained spoken instructions is challenging due to (1) complex structures and the wide variety of expressions used in spoken language, and (2) inherent ambiguity of human instructions. In this paper, we propose the first comprehensive system for controlling ro...

متن کامل

Determination of Features for a Machine Learning Approach to Pronominal Anaphora Resolution in Basque

In this paper we present the preliminaries for a machine learning approach to resolve the pronominal anaphora in Basque language. In this work we determine the appropriate features to be used in this task.

متن کامل

Learning to Disambiguate Natural Language Using World Knowledge

We present a general framework and learning algorithm for the task of concept labeling: each word in a given sentence has to be tagged with the unique physical entity (e.g. person, object or location) or abstract concept it refers to. Our method allows both world knowledge and linguistic information to be used during learning and prediction. We show experimentally that we can handle natural lan...

متن کامل

Widdowson and Classroom Discourse

  Drawing on recent developments in linguistic description and applied linguistics, it can be concluded that learning a language necessitates getting to know something and being able to do something with that knowledge: competence, and performance. Structural approach to language description attaches importance to the former; communicative approach to the latter. Appropriate classroom discours...

متن کامل

To Appear Coling-acl '98: Workshop on Usage of Wordnet in Natural Language Processing Systems Incorporating Knowledge in Natural Language Learning: a Case Study

Incorporating external information during a learning process is expected to improve its eeciency. We study a method for incorporating noun-class information , in the context of learning to resolve Prepo-sitional Phrase Attachment (PPA) disambiguation. This is done within a recently introduced architecture , SNOW, a sparse network of threshold gates utilizing the Winnow learning algorithm. That ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998